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Abstract. This short note demonstrates how one can define a transformation of a non-zero sum game into 
a zero sum, so that the optimal mixed strategy achieving equilibrium always exists. The transformation is 
equivalent to introduction of a passive player into a game (a player with a singleton set of pure strategies), 
whose payoff depends on the actions of the active players, and it is justified by the law of conservation of utility 
in a game. In a transformed game, each participant plays against all other players, including the passive player. 
The advantage of this approach is that the transformed game is zero-sum and has an equilibrium solution. The 
optimal strategy and the value of the new game, however, can be different from strategies that are rational in 
the original game. We demonstrate the principle using the Prisoner's Dilemma example. 
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Let X and Y be a dual pair of ordered linear spaces with respect to bilinear form (■,■) : X x Y — >M.. li x ^ X 
is a utility function x : fi — > K, and p G Y, is a probability measure p : fl — > [0, 1], then the expected utility is: 

E p {x} = (x,p) 



• Consider a game such that the space of outcomes of the game is fl :— Q\ x • • • x fi m , where fli is the set of pure 
strategies of ith player, and Xi : Q — > R are the utility functions of the players. In this case, the game is zero-sum 
if and only if 

X\ + • • • + x rn = 
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If pi, . . . ,p m £ Y are mixed strategies, then 
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sup inf E plX ... XPm {a;i} < inf supE plX ... xp2 {xi} 
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The famous Min-Max theorem pQ states that in zero-sum games, there always exists a mixed strategy p\ X • • • X p m , 
called a solution, such that the above holds with equality, and the common value is called the value of the game. 

One of the problems in game theory is the existence of a solution to a non-zero sum game. Let us consider a 
game, such that 

xi H h x m ^ 

If the utility functions add up to a constant function, so that J2 x i G Kl : = 6 X : f3 6 E}, then one can add a 
constant function x :— —-^^2xi to each Xi so that the new utilities Xi = Xi + Xq are zero-sum. We argue that the 
same can be done in the case when xq is not a constant function. Thus, we define a bijection T : X — > X by 
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T(x) := X + Xq = X > Xi 
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It is easy to see that the new utility functions are zero sum: 
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xi H h x m = ^(xi + x ) = ^ Xi + mx = )Xi-}^Xi = 

i—1 i—1 i—1 i—1 

Notice also that the new utility functions are proportional to the differences between the original utility functions 
and the sum of utilities of other players 

Xi = T{xi) = Xi - — Xt — — f (to - l)xi - : 
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In zero-sum games, Xj = Xi (i.e. T(x) = x), because xq = 0, or equivalently the utility functions differ by some 
constant function x, — Xi G Ml. Our transformation applies also to the case when these differences are not constant 
functions. 

The interpretation of adding utility xo to X, is as follows. We can extend Q — n^Li ^» 1° f2 := J?o x J2, where 
i?o := {0} is a singleton set so that Q = Qq x fl = Q (because a singleton set plays the role of a unit with respect 
to multiplication of sets). The singleton set f2o = {0} represents a player with only one pure strategy, and we refer 
to it as a passive player. In another interpretation, f2 may represent a player, whose pure strategy has already 
been chosen. The payoff ttixq : Q — > M to this player, however, may be non-constant, and depend on strategies of 
the other active players (whose pure strategies have not yet been chosen). We argue that without including f2 and 

x into the representation, the total utility of active players X\ + h x m = — mx is not constant, and therefore 

such a representation contradicts the conservation law of utility in a game: 

xH h x m e Ml 

We argue that games, in which the above law is broken, such as the non-zero sum games, have incomplete repre- 
sentation whereby some passive player (or players) has not been taken into account. The complete representation 
is achieved by transformation T(X) = X + x , and the transformed game is zero sum. A solution to such a game 
always exists, and its value is 

Epi x-xp m {^i} = lEpj x-xp m { x i + ^o} 

If Epj x---xp m {xo} = (e.g. if xo = 0), then this value equals the value of the original game. 

Example 1 (Prisonner's Dilemma). Let us consider the classical example of two-person game, when fl = fi\ x J? 2 , 
and each player (the prisoner) has two pure strategies fli = {0, 1} — cooperate (ui = 0) or defect (ui = 1). The 
payoff to each player is given by the utility function Xi : fi — > R, which can be written in a 2 x 2-matrix form 

, , /-.6-lfA t 
xi = (Xuj) = I (J _5 ) > ^ = x[ 

The classical 'rational' solution to this game is for each player to use strategy oj = (cooperate), so that Pi(0) = 1, 
and the value of the game is 

Eoxo{xi} = -.6 

However, when human participants play this game, they usually cooperate or defect with almost equal probability 
so that the observed strategies are dramatically different from pi(0) = 1. Note that for pi(0) = .5, the expected 
payoff to each player becomes significantly lower 

E. 5 x.5{^} = -3.9 

This 'irrational' behaviour presents a paradox that defied many attempts to explain it. 
Observe that the game described is not zero-sum, because 

(xuj) + (x 2ij ) = ^ J'jq => xi + x 2 £ Ml 

so that in fact the utilities x\ and x 2 do not differ by any constant function. Let us now introduce the passive player 
J?o = {0} (the detective), whose payoff is 2xq = — (xi + x 2 ), and so xq is 



Xo = (XtHj) = ~ ( Uj) : lr-lij) 
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Thus, the payoff to the detective depends on the strategies of the prisoners, and it is maximised if at least one of 
the prisoners defects. The utilities of the prisoners are transformed T(x) = x + xq to 

xi = {xuj) = , x 2 = -xi 

In this representation, the game becomes zero-sum, and it has solution pi(0) — .5 and the value E.5 X . 5 {xi} = 
^.5x.b{xi + x } = —3.9 + 3.9 = 0. This expected payoff to each player is lower than that of the strategy Pi(0) = 1, 
but it is independent of the decision of the other player to defect. 
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